Shape plan mode around microscope context hierarchy#32
Conversation
|
the latest comments from #23 apply here. Need a way to streamline plan generation itself - it currently takes a long time to make a plan. the tool calls might not be the most effectively organized to create the campaigh - phase - task structure, and the class of tasks - bench, imaging, genetics, etc,... there is a question of structure that constraints, versus structures that enrich the experience for a biologist looking for discovery - at the same time, we have microscope support, so need to maximize that supported integration. Need design that takes care of all of this - and is aesthetic. |
|
I posted the empirical setup/preflight status in the #23 thread: #23 (comment) That run could not complete a true plan-quality benchmark because this environment lacks Your point here is consistent with what the preflight suggests: the next design iteration should focus on streamlining plan generation itself, not just adding more fields. The campaign/phase/task structure should probably be created in fewer, more deliberate operations, with enough structure to support microscope integration while still feeling like a discovery-oriented planning surface for the biologist. |
|
I replied on #23 with the empirical setup status. Relevant to this PR: the benchmark preflight found a web UI rendering blocker, fixed here in commit I agree with your design concern. The next iteration should probably focus less on adding more fields and more on streamlining the generation operation itself: one coherent campaign/phase/task construction path, task classes that enrich rather than constrain, and microscope-support context kept explicit. #32 gives the context substrate; the next PR should likely address the planner workflow/UX around that substrate. |
|
Clarifying my earlier wording because I mixed two separate things. The missing Those blockers do not prevent a follow-up implementation pass on #32. The remaining implementation/design work is still real and can start from code review and tests:
So the clean split is:
I am going to implement what I can from the workflow side now: reduce the number of separate tool calls needed to create a structured campaign/phase/task plan, while leaving the empirical benchmark as a separate blocked validation step. |
|
Follow-up implemented from the clarified #32 thread in commit What changed:
Why this addresses the implementation side of your comment:
What remains unvalidated:
Verification:
|
|
Follow-up implemented for the proposed benchmark path in commit What changed:
This is deliberately not a claim about real model latency, browser UX, or generated-plan quality. It is an offline regression check for the streamlined structured-plan tool path while the empirical benchmark remains blocked by missing model/browser/runtime dependencies in this environment. Verification:
|
Context
This PR takes in the context from pskeshu's PR #23 reply and the Kesavan/Nordenfelt smart-microscopy framework (Journal of Microscopy 2026, doi: 10.1111/jmi.70063). It maps the technical/experimental/theoretical/conceptual hierarchy into Gently plan mode so the planning layer can become the biologist-facing interaction layer over DiSPIM, C. elegans embryos, and later other experiment modalities.
This is intentionally one PR for iteration, per the request in the PR #23 thread.
Changes
PlanContexton plan items withtechnical,experimental,theoretical,conceptual,sample_entity,operator_context,constraints, andsuccess_question.plan_contextthrough the SQLite context store, file-backed context store, templates, and plan restoration paths.create_plan_itemandupdate_plan_itemaccept context hierarchy data and render it in plan review/export output.Verification
python -m py_compile gently/harness/memory/model.py gently/harness/memory/_plans.py gently/harness/memory/store.py gently/harness/memory/file_store.py gently/harness/plan_mode/prompt.py gently/harness/plan_mode/tools/planning.py gently/harness/plan_mode/tools/validation.py tests/test_context_store.py tests/test_plan_context_validation.pypython -m pytest tests/test_context_store.py tests/test_plan_context_validation.py -q -p no:cacheprovidergit diff --check